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© IMAGE DATA CODING METHOD, IMAGE DATA DECOO.NQ . ^^Oa .MAGE DATA COO.NG DEVICE, 
IMAGE DATA DECOOING DEVICE, AND IMAGE RECORDING MEDIUM. 

field and an even-numbered Held. A circuit (2!) determmes the cod.ng ^ e ^^*^™ H5 17) and 
every frame), according to the median or the motion vectors A control arcu. ^^^^^ of 
switches (6. 9. 10. 15) corresponding to the determined cod.ng mode, and ,t generates ^a P'^tove 9 
every field or frame from buffer memories (7. 3). Circuits from a J^^^^^^ 1 ^ 
calculate the difference data between an image to be coded and l ,he pred cdve ™ ^ VLC . 

difference data to discrete cosine transformation, and thereafter f^^. w '^^?^^ t 
circuit (25) sets the cooing mode as a Hag in the header of the predetermined h.erarchy of a b.t stream. 
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TECHNICAL FIELD 

This invention relates to an encoding method and an encoding device for high efficiency encoding of 
picture signals by orthogonal transform, a picture data decoding method and a picture data decoding device 
for decoding picture data resulting from high efficiency encoding, and a picture recording medium. 

BACKGROUND ART 

According to a draft for standardization for the high efficiency encocing system for picture signals, as 
proposed by the Moving Picture Expens Group (MPEG), a nigh efficiency =ncccir.g system , : cr ;;c;-re 
signals for a so-called digital storage medium is prescribed. The foiiowing is the principie of tne hign 
efficiency encodina system by MPEG. 

That is with this high efficiency encoding system, a difference is first taken between pictures to lower 
the redundancy along the time scale. Subsequently, discrete cosine transform (OCT) and variable length 
coding (VLC) are carried out to lower the redundancy along the spatial axis. 

The redundancy along the time scale is first explained. 

In general in consecutive moving pictures, a picture under consideration, that is a picture at a given 
time point bears strong similarity to temporally previous and temporally succeeding pictures. Consequently, 
by taking a difference between a picture now to be encoded and a temporally forward p.cture. ana by 
transmitting the difference, as shown in Fig.1. it becomes possible to_diminish the redundancy along the 
time scale and hence the amount of the information to be transmitted, i he picture encoded in this manner 
is termed a predictive-coded picture. P-picture or a P-frame. as later explained. 

Similarly by taking a difference between a picture now to be encoced and a temporally forward p.cture. 
a temoorally backward picture or a interpolated picture produced from the temporally forward and 
s temoorally backward pictures, and transmitting the smallest of the differences, it becomes poss.ble to 
diminish the redundancy along the time scale and hence the amount of the information to be transmitted. 
The oicture encoded in this manner is termed a bidirectional^ predictive-coded picture. 8-p,cture or a B- 
frame as later explained. In Fig.l. a picture indicated by I is an intra-ccced picture as later explained, while 
pictures indicated by P and B - in the figure are the above-mentioned P-picture and the 8-p.cture. 

o resoectively. . 

' For producing prediction pictures, so-called motion compensation is performed. According to the motion 
compensation a 16 x 16 pixel block, referred to herein as a macro-block, made up of unit blocks each 
consisting of 8 x 8 pixels, is prepared, one of those macro-blocks of the previous picture which ,s in the 
vicinity of a macro-block under consideration and has the smallest of the differences is retrieved and a 

,s difference between the macro-block under consideration and the macro-block thus retr.eved is taken to 
diminish the volume of data to be transmitted. For example, with the above-mentioned pred.ct.ve-coded 
picture or P-picture. one of picture data produced by taking a difference between the prediction p.cture and 
the motion-compensated prediction picture and picture data produced by not taking a d.fference between 
the-prediction picture and the motion-compensated prediction picture, whichever is smaller .n data volume. 

io is selected and encoded on the basis of the 16 x 16 pixel macro-block as a unit. 

However in such case, a larger amount of data need to be transmitted for a picture portion wn.ch has 
aopeared from behind a moving object. In this consideration, with the above-mentioned b.d.rect.onally 
coded picture or 8-picture. one of picture data corresponding to the difference between the p.cture data 
now to be encoded and the decoded and motion-compensated temporally forward picture data, p.cture data 

,s corresponding to the difference between the picture data now to be encoded and the decoded and motion- 
compensated temporally backward picture data, picture data corresponding to the difference between the 
picture data now to be encoded and interpolated picture data prepared by adding the decoded and motion- 
compensated temporally backward and temporally forv/ard picture data and the picture data for which the 
difference has not been taken, that is the picture now to be encoded, whichever has the smallest data 

so volume, is encoded. 

The redundancy along the spatial axis is hereinafter explained. 

The difference of the picture data is not transmitted directly, but is processed with discrete cosine 
transform (DCT) from one 8 x 8 pixel unit block to another. The OCT represents a picture depending on 
which frequency components of a cosine function are contained in a picture and in which amounts these 
55 frequency components are contained, instead of on the pixel level. For example, by two-dimens.onal DCT. 
picture data of the 8 x 8 unit block is transformed into a 8 x 8 coefficient block of the components of the 
cosine function. For examole. picture signals of a natural scene as imaged by a television camera 
frequently represent smooth signals. In such case, the picture data volume may be efficiently d.m.n.shed by 



EP 0 573 665 A1 



processing the picture signals with OCT. 

The data structure handled by the above-mentioned encoding system is shown in Fig. 2. The data 
structure shown in Fig.2 includes, from the lower end on, a block layer, a macro-block layer, a slice layer, a 
picture layer, a group-of-picture (GOP) layer and a video sequence layer. The data structure is now 
explained, from the lower layer on, by referring to Fig.2. 

First, as to the block layer, each block of the block layer is composed of 8 x 8 neighboring pixels, that 
is 8 pixels of 8 lines, of luminance or color difference. The above-mentioned OCT is performed for each of 
these unit blocks. 

The macro-blocks of the macro-block layer are mace up of left and right upper and lower four 
neighboring luminance blocks or unit luminance blocks YO. Y1, Y2 and Y3 and color difference blocks or 
unit color difference blocks Cr, Cb. which are at ;he same positions on the picture as the luminance blocks, 
as shown at E in Fig.2. These blocks are transmitted in the sequence of YO, Y1 , Y2. Y3. Cr and Cb. Which 
picture is used as a prediction picture, that is a reference picture of difference taking, or whether or not a 
difference need not be transmitted, is decided on the macro-block basis. 

The above-mentioned slice layer is made up of one or more macro-blocks arrayed in the picture 
scanning sequence, as shown at D in Fig.2. At a header of the slice, the difference of the dc component 
and the motion vector in the picture are reset. On the other hand, the first macro-block has data indicating 
the position within the picture, such that reversion may be made in case of an error occurrence. 
Consequently, the length or the starting position of the slice are arbitrary and may be changed depending 
on the error state on the transmission channel. 

As to the picture layer, each picture is made up of at least one or plural slice(s), as shown at C in Fig.2. 
Each picture may be classified into the intra-coded picture (l-picture or l-frame), the predictive-coded 
picture (P-picture or P-frame), bidirectionally coded picture (B-picture or B-frame) and the OC intra-coded 
picture (DC coded (D) picture). 

It is noted that, for encoding the intra-coded picture or l-picture, only the information which is closed 
within each picture is employed. In other words, for decoding the l-picture. only the information contained in 
the picture concerned is employed. In effect, or encoding a picture by intra-coding, the picture is directly 
discrete cosine transformed without taking a difference. Although this encoding system usually has a poor 
efficiency, random accessing or high-speed reproduction may be enabled by inserting this picture at 
arbitrary places. 

As to the predictive-coded picture (P-picture), the l-picture or the P-picture positioned temporally 
previously at an input and already decoded is employed as a prediction picture (reference picture in 
difference taking). In effect, encoding the difference between the prediction picture and the motion- 
compensated prediction picture or directly encoding the prediction picture, that is without taking the 
difference, whichever is more efficient, is selected on the macro-block basis. 

As to the bidirectionally coded picture or 3-picture, three types of pictures, namely a temporally 
previously positioned and already decoded l-picture, a temporally previously positioned and already 
decoded P-picture and an interpolated picture obtained from these pictures, are employed as a prediction 
picture. In this manner, encoding of the difference between the prediction picture and the motion- 
compensated picture and the intra-coding, whichever is more efficient, may be selected on the macro-block 
basis. 

The DC intra coded picture is the intra-coded picture which is made up only of DC coefficients in OCT 
and which cannot exist in the same sequence as the remaining three pictures. 

The GOP layer is made up of one or plural l-picture(s) and zero or plural non-l-pictures, as shown at 8 
in Fig.2. The distance between the l-pictures, such as 9, or the distance between the l-pictures or the 8- 
pictures, such as 3. is arbitrary. Besides, the distance between the l-pictures or between the 8-pictures may 
be changed within the inside of the GOP layer. 

The video sequence layer is made up of one or plural GOP layer(s) having the same picture size or the 
same picture rate, as shown at A in Fig.2. 

For transmitting the moving picture standardized in accordance with the high efficiency encoding 
system by MPEG, as described above, picture data produced by compressing a picture in itself are 
transmitted, and subsequently a difference between the picture and the motion-compensated same picture 
is transmitted. 

However, the following problem has been found to be raised when the picture to be encoded is an 
interlaced picture resulting from an interlaced scanning. 

That is, if a picture resulting from the interlaced scanning is encoded on the field-by-field basis, a 
difference in the vertical positions is alternately incurred from field to field. Consequently, when transmitting 
a stationary portion of a moving picture, difference data is produced at a boundary between the fields, 
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notwithstanding that the picture portion remains stationary. Since the difference data needs be transmitted, 
the encoding efficiency is lowered at the stationary portion of the moving picture. 

Also, if a picture produced by interlaced scanning is encoded on the field-by-field bas.s. s.nce each 
block is formed on the field-by-field basis, the interval between pictures becomes broader than if the block 
is formed on the frame-by-frame basis, with the result that correlation and hence the coding efficiency are 
lowered. 

On the other hand, if the picture resulting from interlaced scanning is encoded on the frame-by-frame 
basis, the moving cortion in the frame is blurred in the shape of a comb. For example, if a moving object, 
such as a motor vehicle, is present ahead of a stationary object, the motor vehicle, which is the moving 
oonion. becomes olurrea wnen viewea as a frame, as indicated at KS in Fig.4. because of motion between 
fields. The result is that high-frequency components, not present in the original picture, are transmitted, thus 
lowering the encoding efficiency. 

Besides with the frame-by-frame encoding, since encoding is made on the basis of two consecutive 
fields making up a frame as a unit, predictive coding cannot be employed between the two consecutive 
fields. Thus the minimum distance of predictive coding becomes a frame or two fields. Consequently, as 
compared to the field-by-field coding with the minimum predictive coding distance of one field, the frame- 
by-frame encoding is disadvantageous in respect to a picture having a fast or intricate motion. 

As discussed in the foregoing, there are occasions wherein the field-by-field encoding is lowered in 
encoding efficiency with corresponding rise in encoding efficiency of the frame-by-frame encoding, or 
wherein the frame-by-frame encoding is lowered in encoding efficiency with corresponding rise in encoding 
efficiency of the field-by-field encoding. 

In view of the above-described status of the art. it is an object of the present invention to prov.de a 
picture data encoding method and a picture data encoding device whereby a picture produced by 
interlaced scanning may be encoded efficiently whether the picture is replete with motion, the p.cture shows 
only little motion or the picture replete with motion and the picture showing only little motion co-exist, and a 
picture data decoding method and a picture data decoding device for decoding picture data encoded by the 
encoding method and the encoding device. 

DISCLOSURE OF THE INVENTION 

A first picture data encoding method for encoding interlaced picture signals according to the present 
invention comprises deciding if encoding of picture signals is to be executed on the field-by-field bas.s or 
on the frame-by-frame basis, encoding the picture signals on the field-by-field basis or on the frame-by- 
frame basis, depending on the results of decision, for generating encoded picture data, and appending 
discrimination data to the encoded picture data based on the results of decision, the discr.mmation data 
indicating if the encoded picture data have been encoded on the field-by-field basis or on the frame-by- 
frame basis. 

According to a second picture data encoding method for encoding interlaced picture signals according 
to the present invention, encoded picture data has a hierarchical structure and the discriminating data is 
provided at a portion of a header of a predetermined hierarchical layer of the encoded picture data. 

According to a third picture data encoding method for encoding interlaced picture signals according to 
the present invention, the decision in the first picture data encoding method is executed based on the 
motion vector of each macro-block. 

A fourth picture data encoding method for encoding interlaced picture signals according to the present 
invention comprises encoding picture signals on the field-by-field basis for generating encoded p.cture data, 
and appending discrimination data indicating that the encoded picture data have been encoded on the f.eld- 
by-field basis to the encoded picture data. 

According to a fifth picture data encoding method for encoding picture signals according to the present 
invention, encoded picture data has a hierarchical structure in the fourth picture data encoding method and 
the discriminating data is provided at a portion of a header of a predetermined hierarchical layer of the 
encoded picture data. 

A sixth picture data encoding method for encoding interlaced picture signals according to the present 
invention comprises encoding the picture signals on the frame-by-frame basis for generating encoded 
picture data, and appending discrimination data indicating that the encoded picture data have been encoded 
on the frame-by-frame basis to said encoded picture data. 

According to a seventh picture data encoding method tor encoding picture signals according to the 
present invention, encoded picture data has a hierarchical structure in the sixth picture data encoding 
method, and the discriminating data is provided at a portion of a header of a predetermined hierarchical 
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Fio 25 f S h0 a W Mo he TT» °' PiCtUreS Sntered t0 PiCtUfe d3ta enCOdin 9 device ^ i" F'9-23 

is a flow chart illustrating an algorithm of an encoding system decision circuit mnZit,,, ™ > h 
p.cture data encoding device shown in Fig.23. aec.s.on c.rcu.t constitut.ng the 

Fig.26 shows a concrete format of a header od encoded picture data 
acco^n 2 g^o^h a eXS:t i r VVin9 3 C ° nCrete drCUit C ° nStrUCti0n ^ 3 d3ta d — ^ "V- 

the p-S ":^ZZ^^T drCU,t COnStrUCti ° n ° f " mVSrSS VLC 
present?nvention. ,OW ^ illUStratm9 ^ ° Perati ° n °' ^ PtetUr9 ^ deC ° d ^ device according to :he 

■-ig.30 is a flow chart illustrating the operation of the picture data decoding devic= accorfinn •» -ho 
present invention. y ue/| c- according t o ihe 

BEST MODE FOR CARRYING OUT THE INVENTION 

By referring to the drawings, concrete embodiments of the present invention will be explained in detail. 
1. Picture Data Encoding method/Picture Data Decoding Method 

will b T e h txplLi n ed. field "* frame ' by -' rame -coding/decoding operations according to the present invention 

fi e .d'U h lH PiCtUre Sh r n in Fi9 " 1 ,S 3 field - b y- fie,d P^ture. it may be represented as shown in Fig 5 if its 

nZtrZ rZ S , 0n , ,S t3ken int ° aCC ° Unt - ' n Fi9 ' 5 - an upper raw denotes • ««t field, such as an od£ 
numbered held, and a .ower row a second field, such as an even-numbered field. Two field temooralt 
accent each other at an interval of 1/60 sec make up a frame. With the field-by-field encodfngTd'codtna 
operation, each picture is encoded/decoded on the field-by-field basis encoa.ng/decoding 

,nr,L C ,° nCre ^ ! Xamp ' e ^ Whi ° h 3 constructi ° na ' Pa«ern of the l-oicture (intra-coded picture) / P-oicture 
c«ZZ , P ' C c r6> ' B -P lct - e (bidirectiona,ly coded picture) in the GOP layer sholT n Fiq 2B s 
i 5£ ed whTe^os s'fn^ 95 - 5 * ^ ** ^em of the pictures in the GOP ay 

hi kIm h f 9 ^ COmmon ,n carr y |n 9 ou t the encoding/decoding operations on the field- 

by-he.d bas.s. Meanwhile, if the encoding type is the same between the first and the second fields a J in 
F.g 6. the f,rst and the second fie.ds may be encoded collectively, in which case the operation is the Irame 
by-frame encoding/decoding operation shown in Fig.7. operation ,s the frame- 

*«J!r V /w ietV . 0f m0ti ° n prediction and compensation operations may be implemented for th P 

To ,c 2". 6 9UfeS - f ° r exam P' e - in F '9S-8 and 9, a thick broken-lined arrows from oicture 12 

to p cture 15 and from picture ,3 to picture P6. indicate motion prediction to P-pictures On thither hand 

ngs r :^e°Tond tj to pictur : 80 in Fi9s - 8 and 9 indicate motion « ^^z^t 

a " d 10B - so,ld - ,ir1 ed arrows indicate motion prediction of the frame-based macro-blocks while 
broken-lined arrows mdicate motion prediction of the field-based macro-blocks 

* m „r th the . field -°y- field encoding/decoding, the P-picture. such as the P-oicture P5 shown in Fio fi 

p"ture y Vat P s CtUre f ' 2 - * previ ° US ' y and decoded asTp edSon 

picture, that is a reference p.cture for difference taking, while the picture P8 employs a oicture ps X 

F°Trm Dl temP ° ra,ly PfeVi0USly 3nd 3lre3dy deCOded ' 35 3 P^don Picture' The oc u rP6 s^: n 
F,g.9 employs a picture P6 as a prediction picture, whi.e the picture P7 employs a picture P6 as a 
pred,ct,on picture. The 8-picture, such as the picture B4 shown in Fig.8. emp.oys three SuTes that is 
picture .2 p.cture P5 and an interpolated picture of these pictures, as a prediction picture Bo Su e 
sucn as the picture 84 shown in Fig.9, empioys three pictures, that is picture 13 picture P6 and an 
interpolated p.cture produced from both of these pictures, as a prediction p.cture 

such as pictures X^nTp^T^T ^^ecoding, a frame made up of P-pictures or frames, 
sucn as pictures P6 and P7 shown ,n Fig. 1 0A, employs a frame made up of pictures 12 and 13 as * 

P6 anrpV'^r' "Tr* ^ ^ UP ° f P ' CtUreS Pl ° 3nd Pl 1 emp, °* s 3 < ra ™ m*e up^f pictu es 
PS and P7 as a pred.ct.on p.cture. On the other hand, a frame made up of 8-pictures or frames s ^h It 

^ 1 , P0S 'i . temp0r3 "y P revious, V and a '^^y decoded, a frame made up of pictures P6 and P7 
and an interpolated picture produced from both of these pictures, as a prediction picture 

As discussed ,n the foregoing, the field-by-field encoding/decoding operations and 'the fram* h„ i, ama 
encoding/decoding operations are common in the encoding/decoding ^r^ ^w!": 
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explained subsequently, they differ in the block constitution and motion prediction/compensation. 
(1) Block Constitution 

Since the first and second fields are collectively encoded and decoded with the frame-by-frame 
9 ncSCSd!Udino operations, it is possible to combine the first and second fields ,ntc -a block. On lh e 
other hand, with the fie.d-by-neld processing, a block may be consulted by any one of ,hese fields. 

i2) Mocion Preaiction/Comcensation 

With frame-by-frame encoding/decoding operations, the encoding/decoding operations are Permed 
with the first and second fields combined together, so that motion pred.ct.on towards the secono . eld 
belonging to Te and the same frame ,s not employed. However, with fie,d-by-fie.d process.ng. motion 
orediction r'rom the first field to the second field is employee . 

Oetans or the above paragraphs <i> Block Constitution and (2) Motion Pred.ct.on/Comp.ns.don are 

explained. 

(1 ) Block Constitution 

Fig 1 1 shows the constitution of inner blocks of the macro-block in the fie.d-by-Meld s ^y^°^ 
ooera'ons. As shown in Rg.il. with the field-by-field encoding/decoding operates, the ne.d-by-h.id 
m^rrn-hlock is constituted solely by the first fieid or by the second held. 

On hfo^her hand Fio.12 Ihows the constitution of inner blocks of the macro-block in the framed-by- 
frame ercod-ng/decoding ooerations. With the framed-by-frame encoding/decoding operates a frame-oy- 
frame m^^Hhown in Fig. l2 C may be employed in addition to the field-by-fie.d macro-blocks shown 
n Fios 12A ^ and 128 That is. with the field-by-fie.d macro-block, it is poss.b.e to divide a macro-o.ock ,nto 
u P p?r and lower two blocks, with the upper two b.ocks being constituted so.e.y by the first fields and , he 
"ower t^o blocks being constituted so.e.y by the second fields, in addition to the first and second fie.d^ o 
he ma^o-btock construction which are the same as in the field-by-fie.d encod.ng/decod.ng operations 
ZZ Z fZ " 1. On the other hand, the frame-by-frame macro-block is constituted by nrst and second 

^''fn "ifmln^ef with'L frame-by-frame encoding.deccd.ng operations, a frame-by-frame macroWk 
may be employed in addition to the fie.d-by-fie.d macro-block in the field-by-fie.d encod.ng/decod.ng 

0Pe Mean S whi.e a switching between the fie.d-by-fie.d macro-block and the frame-by-frame macro-block as 
described r! the foregoing may be realized by controlling readout addresses at the time of read.ng out 
bSS from buffer memores 7 and 8 based on the discrimination information for discr.m.nat.on between the 
^d by S encoding and the frame-by-frame encoding, as determined by an encod.ng system deasjon 
drcui^ 2 Fig 23) of the oicture data encoding device as later explained. With a p.cture data decod.ng 
ZZe as later explained, such switching may also be realized by detecting a flag recorded or superposed 
fn Tn encoded bU stream received by a p.cture data encoding device, etc. and dead, ng whether he 
encoding system is a field-by-field encoding system or a frame-by-frame encoding system, based on 
detected hig by an inverse variable length coding circuit 31 (Fig.27). and transm.tt.ng the correspo^ng 
fnformaln I motion compensating circuits 42. 43 etc. for controlling the readout addresses of the ouffer 

JS mem ThaHs^he 8 buffer memories 7 3. 37. 38 are constituted by memories each having a storage capacity of 
480 ^ W I, sZ H ^ Figs.13 and 14. If the encoding system is designed on the frame-by-rrame 
basis fctu e dL ie stored on the frame-by-frame basis and. if the encoding system is ^designee or, . t he 

- 0 ^r^eSr b^emporr^^^ 

cSadty to a Sme Ire is no limitation to the memory capacity, and may be larger in -paoty tha * .the 

eTcX/decoding "operations, there is no necessity of employing two buffer memor.es wh.ch may be 
55 combined into a sole buffer memory. 



35 



-JO 



9 



EP 0 573 665 A1 



• 



(2) Motion Prediction/Compensanon 



With the field-by-field encccing/decoding operations, motion prediction from a first field to a second 
field of one and the same frame, such as prediction from picture 12 to picture B3 as shown in Fig 8 or 
5 prediction from picture P6 to picture 97 shown in Fig.9, may be employed in effecting motion 
prediction/compensation. 

However, with frame-based encoding/decoding, since two fields are encoded/decoded collectively, as 
shown in Fig.7, motion prediction from a rirst field to a second field belonging to the same frame as the first 
field is not employed. 

■o As discussea in ;he foregone, since :ne motion orediction of the field-bv-field encoding/decoding 

operation makes use of the mener, precision from ihe first field to the second field of one ana" the same 
frame, the minimum distance between the pictures to be motion-predicted is diminished such that motion 
prediction comprises motion precision of the frame-by-frame encoding/decoding operations. Although the 
motion prediction of the frame-cased macro-block is shown in the above-described concrete embodiment 

is the same motion prediction is used for two fields within the macro-block so that two motion predictive 
operations of the field-based macro-blocks may be used in substitution. Besides, the frame-by-frame motion 
prediction is not the motion precision indispensable to the frame-by-frame encoding/decoding ooerations. 
and only the field-by-field motion prediction suffices. 

In this manner, with the encoding/decoding method for pictures produced by interlaced scanning, 

20 according to the present invention, the encoding/decoding operations by both the field-by-field 
encoding/decoding method and the frame-by-frame encoding method are possible by associating the block 
constitutions and the method for controlling the motion prediction with the two encoding/decoding oper- 
ations. 

In such case, it is necessary :o transmit to the picture data decoding device data indicating which of the 
25 field-by-field encoding operation and the frame-by-frame encoding operations has been carried out, and in 
which range the encoding operation has been carried out. 

To this end, according to the oresent invention, a flag indicating whether a range of an encoded picture 
has been processed on the field-by-field basis or on the frame-by-frame basis is provided at a portion of an 
encoded bit stream, that is enccced picture data. Meanwhile, this range means, for example, a sequence, 

30 GOP or a picture. Concretely, with the picture data encoding device, discrimination data for discriminating 
whether a range of a picture has been encoded on the field-by-field basis or on the frame-by-frame basis is 
set or superposed at a predetermined position of the encoded bit stream. The picture data decoding device 
deciphers such predetermined position of the encoded bit stream by the inverse variable fength encoding 
circuit 31 as described above for determining the type of the unit of the decoding operation to perform a 

as decoding operation accordingly for picture reproduction. 

It should be noted that the motion prediction in the field-based and frame-based encoding/decoding 
operations is not limited to the concrete embodiment shown in Figs.8 to 10. For example, in the frame- 
based encoding/decoding operations employing the motion prediction shown in Figs. 15 and 16. switching 
may be made for each hierarchical layer of the picture and the frame-based encoding/decoding operations. 

Specifically, Fig.iSA shows a concrete example different from the P-picture predicting method for fieid- 
by-field encoding/decoding operations, shown in Fig.8 f in the constructional pattern of the l/P/B picture in 
the GOP layer shown in Fig.5. Ho. 158 shows a concrete example different from the B-picture predictive 
method of the field-by-field encooing/decoding operations. That is, a picture P5 is used in the concrete 
example of Fig.8 as a prediction picture for picture P8. while a picture such as a picture 12 may also be 

45 employed as shown in Hg.15A. On the other hand, while three pictures, namely the picture 12. picture PS 
and an interpolated picture of these two pictures, are used in the concrete example of Fig.8 as a prediction 
picture for picture 84, pictures 12. PS, P8. Pi 1 and interpolated pictures of these pictures, may also bee 
employed, as shown in Fig.lSB. 

Fig.l6A shows a concrete example different from the predictive method for the B-picture in the field- 

50 based encoding/decoding operations shown in Fig.9. while Fig. 168 shows a concrete example different from 
the predictive method for the B-picture in the field-based encoding/decoding operations shown in Fig.9. 
That is, while picture 13 is employed in the concrete example shown in Fig.9, picture 12, for example, may 
also be employed, as shown in Fig.i6A. Besides, while three pictures, namely pictures 13 and P6 and an 
interpolated picture of these two pictures, are employed in the concrete example of Fig 9 as a prediction 

55 picture for picture 84, pictures !2. 13, P6. P7 and interpolated pictures of these pictures, may also be 
employed, as shown in Fig. 168. 

In addition, the field-based encoding/decoding operations may also be combined with the frame-based 
encoding/decoding operations. 
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Fqs 17 and 20 illustrate concrete examples in which a plurality of pictures are encoded/decoded on the 
frame-by-frame basis and next following pictures are subsequently encoded/decoded on the field-by-field 
basis Fig 17 illustrates a combination of the frame-based encoding/decoding operat.ons shown in Fig.7 and 
the field-based encoding/decoding operations shown in Fig.5 and Fig.20 illustrates a combination of the 

- 5 frame-based encoding/decoding operations shown in Fig.7 and the field-based" encoding/decoding oper- 
ations Shown in Fig.6. n ,„h n 
In Fig 17 a frame made uo of oictures P6 and P7 may employ a rrame made up or pictures 12 and 13 

• as a orediction picture, as shown in Fig. 18. while picture P10 may employ picture PS. for example, as a 
oredic:ive picture. On the other hand, a frame made up cf p.ctures 34 and 35 may employ a frame mace 

■o uo of oictures 12 ano 13. a frame mace up cf pictures =6 anc ?7 anc interpolated p.ciuras or tnese Barnes, 
as shown in Fig.19. while picture P8 may employ p.ctures PS. P7. P10 and interpolated p.ctures or these 

P ' Ct Next in Fig 20 a frame made uo of oictures P6 and P7 may employ a frame made up of pictures 12 
and 13 as a orediction oicture. while a picture PiO. for example, may employ picture P6. for example as a 
orediction picture, as shown in Fig.21. On the other hand, a frame made up of pictures B4 and 35 may 
emoloy a frame made up of pictures 12 and 13. a frame mace up of pictures P6 and P7 and interpolated 
oictures of these frames, as prediction picture, as shown in Fig.22. while picture 68. for example, may 
employ pictures P6. P7. P10. Pi 1 and interpolated pictures of these pictures, as prediction p.cture. as also 

shown in Fig.22. . ^ . _ 

In this manner as shown in Figs. 17 to 22. frame-based encoding/decoding operat.ons may be 
combined with field-based encoding/decoding operations without any inconveniences. In other words, it is 
possible with the picture data encoding method and picture data decoding method accord.ng to the present 
invention to achieve efficient encoding for pictures with only small motion, pictures replete w.th motion and 
mixed pictures of these pictures, by switching from frame-based encoding/decoding operat.ons to field- 
based encoding/decoding operations and vice versa when the pictures are produced as a result of 
interlaced scanning. 

2. Picture Data Encoding Device 

Fig.23 is a block diagram showing a concrete c.rcuit construction of a picture data encoding device 
according to the present invention. 

As shown in Fig 23. the oicture data encoding device (encoder) compnses a local decod.ng circuit 
made up of an inverse quantization circuit 2 to gate 17. having the same circuit arrangement as that or a 
picture data decoding device as later explained. 

When picture data of a picture (field or frame) is entered at terminal 1. these picture data are stored 
once in buffer memory 18. Specifically, picture data are entered in the sequence of 10. B1. 82. P3. • as 
shown in Fig 24A and re-arrayed in the sequence in which the picture data are encoded by the encoder, as 
shown in Fig 248. Motion prediction as shown in Fig.l is carried out between the thus re-arrayed p.ctures. 
The- input pictures 10. 81. 82. P3. • • • are shown as fields in Figs.S and 6. while they are shown as frames 

" n F The pictures thus re-arrayed are employed for detecting a motion vector by a motion vector detection 
circuit 19 Based on the previously encoded picture(s). the motion vector detection circuit 19 detects ,he 
motion vector required for generating the picture(s) for prediction. That is. a forward picture and a backward 
picture are held by the buffer memory 18 and motion vector is detected between these p.ctures i and a 
current reference picture. For detecting the motion vector, such a motion vector is employed which has a 
minimum sum of absolute values of the differences between p.ctures on the macro-block bas.s. . 

The motion vector and the sums of the absolute values of the differences between p.ctures on the 
macro-block basis are transmitted to an encoding system decision circuit 21. By employing a'9°»*h^ 
which will be explained later, the encoding system decision circuit 21 decides whether the encoding system 
of a picture of a certain hierarchical layer is a fie.d-based encoding system or a frame-based encoding 
system. The information concerning the encoding system, that is whether the encoding system ,s a field- 
based system or a frame-based system, and the motion vector, are transmitted to motion , compensation 
circuits 12 13 and to variable length encoding circuit 25 so as to be used for controlling the burfer 
memories 7 and 8. while being transmitted to the picture data decoding device as later explained. The 
information concerning the encoding system is transmitted to a controlling circuit 16 which outputs control 
signals controlling the encoding system to gates 5. 17 and changeover switches t 6. 9. 10 and 15. 

in addition, the motion vector and the sums of the absolute values of the differences between p.ctures 
on the macro-block basis are transmitted to an intra-picture / predictive / backward / bidirecfonal pred.ct.on 
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decision circuit 20, referred to herein as a prediction decision circuit. Sased on these values, the prediction 
decision circuit 20 decides the predictive mode for the reference macro-block. 

The predictive mode is transmitted to the control circuit 156. Based on the predictive mode, the control 
circuit 16 outputs a control signal to the gates 5 and 17 for effecting switching between intra-picture / 
5 predictive / backward / bidirectional prediction. The control circuit 16 transmits the motion vector associated 
with the selected prediction mode to buffer memories 7, 8 for generating the prediction picture. More 
concretely, if the mcce is the intra-picture encoding mode, the input picture itself is generated, whereas, if 
the mode is the precictive / backward / bidirectional predictive mode, respective prediction pictures are 
generated, i he operation of the gates 5, 17 and changeover switches 6, 9, 10 and 15 is the same as that of 
to the changeover switches 36. 39. 40 and 45. so that the corresponding ooeration is omitted fcr simplicity. 

The predictive pictures from butfer memories 7, 3 are transmitted ;o a difference unit 22 generating 
difference data between the predictive picture and the current picture. The difference data are supplied to a 
discrete cosine transform (OCT) circuit 23. 

The OCT circuit 23 discrete cosine transforms the input picture data or difference data for each block 
is by taking advantage of the two-dimensional correlation of the picture signals to transmit the resulting OCT 
data to quantization circuit 24. 

The quantization circuit 24 quantizes the OCT data with quantization step size (scale) determined for 
each macro-block or fcr each slice to process the results by so-called zigzag scanning. The quantization 
circuit then transmits the resulting quantized data to a variable length coding circuit 25 and to a 
20 dequantization circuit 2. 

The quantization step size, employed for quantization, is selected to a value of not causing the failure 
(overflow) of the transmission buffer memory 26 by feeding back the residual contents of the transmission 
buffer memory 26. The quantization step size data as welt as the quantization data is supplied to VLC 
circuit 25 and to dequantization circuit 2. 
25 The VLC circuit 25 proceeds to process the information including the quantization data, quantization 

step size, predictive mode, motion vector or the encoding system by variable length encoding, appends the 
coding system information, that is the information concerning whether the encoding system is the fieid-by- 
fietd encoding system or the frame-by-frame encoding system, to a header of the predetermined hierarchi- 
cal layer, and transmits the resulting data a sth transmission data to transmission buffer memory 26. 
30 The transmission buffer memory 26 stores the transmission data temporarily in a memory and 

subsequently transmits the stored data as a bit stream at a predetermined timing. Besides, the transmission 
buffer memory 26 feeds back the macro-block based quantization control signals to the quantization circuit 
24 depending on the quantity of residual data in the memory to control the quantization step size. In this 
manner, the transmission buffer memory 26 controls the data volume gene rated as the bit stream to 
35 maintain a moderate amount of data, that is the data volume which will not produce an underflow or an 
overflow, in the memory. 

For example, if the residual data volume in buffer memory 26 is increased to an allowable upper limit, 
the transmission buffer memory increases the quantization step size of the quantization circuit 24 by 
quantization control signals for lowering the data volume of the quantized data. 

^0 Conversely, if the residual data volume in buffer memory 26 is decreased up to an allowable lower limit 

value, the data volume of the quantization data is increased by decreasing the quantization step size of the 
quantization circuit 24 by the quantization control signals. 

It will be seen from above that the bit stream from the transmission buffer memory 26, adjusted in its 
data volume, is multicfexed with encoded audio signals and synchronization signals. The multiplexed 

45 signals are added to with error correction code data and modulated in a predetermined manner before 
being recorded as pits of projections and recesses on a master disc by means of a laser beam. A stamper 
is produced from the master disc and a large number of duplicated discs, for example, picture recording 
media, such as opticai discs, from the stamper. 8esides, the bit stream is transmitted over a transmission 
channel to a picture data decoding unit as later explained. Meanwhile, the picture recording medium is not 

so limited to the opticai disc, but may for example be a magnetic tape. 

On the other hand, output data from the inverse DCT circuit 3 is added to with the predictive picture by 
addition circuit 4 to effect local decoding. The operation of the local decoding is the same as that of a 
picture decoding device as later explained and hence the corresponding explanation is not made for 
simplicity. 

55 lt is noted that the picture constructing pattern is not limited to that shown in rig. 3 and differs with 

different encoding sequences. Concretely, there are a number of variations in the encoding processing 
sequence of P-ptctures and B-pictures temporally sandwiched between the P-ptctures. However, these may 
be dealt with by simply changing the processing sequence by changing the control for buffer memories 7. 
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8. 

The concrete algorithm for a encoding system decision circuit 21 is hereinafter explained by referring to 

a flow chart shown in Fig.25. 

The encoding system decision circuit 21 selects the encoding system by taking advantage of the 
motion vector from the. first frame, e.g. an odd field, to the second field, such as an even field. 

Concretely, encoding system decision circuit 21 finds the motion vector at step ST1 for all of the 
macro-blocks in a frame to be encoded before proceeding to step ST2. 

At step ST2, encoding system decision circuit 21 finds a median value of each of the horizontal 
components (x) and vertical components (y) of the motion vector, before proceeding to steo ST3. 
Concretely, the median value of the Horizontal comoonents is founa in the rollowing manner. .-;rst. ihe 
horizontal components cf -he motion vector are arrayed in the oraer :f ene railing powers, arc the vaiue of 
the center data is adopted as a median Mv x of the horizontal components. The median value Mv, is round 
in the similar manner. 

At step ST3 encoding system decision circuit 21 finds the size R of the vector MV found at step 5T2 as 
15 a parameter indicating the magnitude of the motion of the entire screen, because the vector MV is the 
parameter indicating the magnitude of the entire screen. The decision circuit then proceeds to steo ST4. 
The size R is found by the following formula 1 
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R = |MV(j = sqrt(Mv x 2 - Mv y 2 ) (1) 

At step ST4 t encoding system decision circuit 21 causes the encoding system to be switched by the 
size R as found at step ST3. Since the field-by-field encoding system (encoding operation) is more 
favorable with pictures with fast motion, and the frame-by-frame encoding system (encoding operation) is 
more favorable with pictures with little motion, the encoding system decision circuit 21 proceeds :o step 
S 75 if tri e size R is not more than a predetermined threshold value TH to select the frame-based encoding 
system. Otherwise, the decision circuit proceeds to step ST6 to select the field-based encoding system. 

It will be seen from above that the picture data encoding device according to the present invention 
decides whether the encoding system is to be performed on the field-by-field basis or on the frame-by- 
frame basis, depending on the motion occurring in the picture, more specifically, on the motion vector 
between the first and second fields of the same frame or on the size of the median value of the motion 
vector, so that efficient encoding may be achieved not only with the picture having abundant motion or little 
motion but also with the picture having both abundant motion and little motion. 

3. Picture Oata Format 



Concrete examples of the bit stream outputted from the present picture data encoding device, that is of 
the picture data format, are hereinafter explained. 

Rg.26A shows an example in which a discrimination data for discriminating whether the encoding has 
been performed on the frame-by-frame basis or on the field-by-field basis (the information concerning the 
*o encoding system) is appended as flag, as shown in Fig.26A. In the video sequence layer, star, code, 
horizontal size, vertical size etc. are recorded with a predetermined number of bits, indicated by numerals in 
the figures, as shown in Fig.26A. In the present embodiment, the discrimination data for discriminating 
whether the encoding has been performed on the field-by-field basis or on the frame-by-frame basis, is 
appended after the frame frequency as a 1-bit flag. The flag may also be appended at some other location. 
45 The above flag may also be added to a header of the GOP layer, as shown in Rg.268. In ihe present 

instance, discrimination data are added following the time code as a l-bit flag. The flag may also be added 
at any other location. 

Fig 26 shows an instance in which the flag is added to the MPEG standard. Such flag may also be 
written in extension regions of each layer as prescribed by MPEG. For the video sequence layer, for 
50 example, it may be written in the extension region (sequence extension bytes) or in user data. Concretely, 
the flag may be added to the extension region of the picture layer according to the MPEG standard, as 
shown in Fig.26C. 2-bit flags are used for indicating the encoding system. The 2-bit flags denote the 
following contents: 

00: A picture encoded on the frame-by-frame basis. 
55 01: A first field of a picture encoded on the field-by-field basis. 

10: A second field of a picture encoded on the fteld-by-field basis. 
1 1 : Spare. 
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4. Picture Data Encoding Device 

Fig. 27 is a block diagram showing a concrete circuit arrangement of a picture data decoder according 
to the present invention. 

5 An inverse variable length coding circuit (inverse VLC circuit) 31 encodes the bit stream from the 

picture data encoder shown in Fig. 23 or the bit stream obtained on reproducing a picture recording 
medium, such as an optical disc, by inverse variable length encoding, and outputs the resulting data to a 
dequantization circuit 32. Simultaneously, the inverse VLC circuit 31 deciphers data written by superposition 
on the picture data during encoding, such as motion vector, quantization width (quantization step size) or 

to encoding information. 

Above all. inverse VLC circuit 31 deciphers the flags appended to headers of the video sequence layer. 
GOP layer or the picture layer to obtain the information as to whether the decoding is to be made on the 
frame-by-frame basis or on the field-by-field basis. Such information is supplied to a control circuit 46 which 
then generates control signals for frame-based processing or field-based processing. Specifically, the 

75 control circuit 46 is responsive to the output of inverse VLC circuit 31 to generate a variety of control 
signals for changing over gates 35 and 47 and fixed contacts of changeover switches 36. 39. 40 and 45. 

That is, if, in the case of frame-based processing, each picture is made up of 480 lines each consisting 
of 720 pixels, and each macro-block consists of 16x16 pixels, picture processing is completed by 1350 
macro-blocks, and a corresponding control signal is generated for processing the next picture. On the other 

20 hand, in the case of fieid-based processing, picture processing is completed by 675 macro-biocks and a 
corresponding control signal is generated for processing the next picture. Besides, frame-based or fieid- 
based decoding is carried out by controlling the scheduling of buffer memories 37, 38. 

On the other hand, the control signal indicating if the decoding is to be frame-based or field-based, 
produced by the inverse VLC circuit 31, is also supplied to motion compensation circuits 42, 43, and a 

25 frame-based or field-based decoding is carried out by the motion compensation circuits 42. 43 controlling 
addresses of the buffer memories 37. 38, as discussed in detail hereinbelow. 

The dequantization circuit 32 inversely scans and dequantizes data processed by inverse VLC to output 
the resulting data to an inverse OCT circuit 33. The inverse OCT circuit 33 inversely discrete cosine 
transforms input data to output the resulting data to an additive node 34. The additive node 34 is supplied 

30 via gate 47 with predictive picture data as selected by switching of the changeover switch 45. The 
predictive picture data is summed to output data of the inverse OCT circuit 33 to generate decoded picture 
data. 

if an output of the additive node 34 is an l-picture or a P-picture. gate 35 is opened, and decoded 
picture data are supplied via changeover switch 36 to buffer memories 37 or 38 so as to be stored therein. 

35 Specifically, if the output of the additive node 34 is the l-picture or the P-picture, changeover switches 

39, 40 are changed over to fixed contacts a. The changeover switch 36 is alternately changed over to 
contacts a and b so that the picture (l-picture or the P-picture) outputted from the additive node 34 is 
alternately stored in the buffer memories 37, 38. 

If-picture data arrayed in the sequence of pictures 10, B1. 82, P3, B4, B5, PS, 87, B8. P9. as shown at 

40 A in Fig. 24. are processed by picture data encoder in the sequence of pictures !0, P3, 81, B2. P6, 84. 35, 
P9, B7, 88, as shown at S in Fig. 24, the picture data are entered to inverse VLC circuit 31 in the latter 
sequence of pictures, as shown at D in Fig. 24. 

As a result thereof, if decoded data of picture 10 are stored in buffer memory 37, as shown at E in 
Fig. 24, decoded data of picture P3 are stored in buffer memory 38, as shown at F in Fig. 24. Besides, data 

45 of picture 10 of buffer memory 37 are updated to data of picture PS, while data of picture P3 of buffer 
memory 38 are updated to data of picture P9, as shown at E and F in Fig.24. 

When data of pictures 81 or 32 are entered to additive node 34 from inverse DCT circuit 33 in the 
wake of pictures 10 and P3, data of picture 10 stored in buffer memory 37 are motion-compensated in 
motion compensation circuit 42 in accordance with the motion vector before being supplied to an 

so interpolating circuit 44. On the other hand, data of picture P3 stored in buffer memory 38 are motion- 
compensated in motion compensation circuit 43 in accordance with the motion vector before being supplied 
to interpolating circuit 44. The interpolating circuit 44 combines input data from the motion compensation 
circuits 42. 43 at a predetermined ratio in accordance with data entered from the inverse VLC circuit 31 . 
The combined data is selected by changeover switch 45 so as to be supplied via its contact b and gate 47 

55 to additive node 34. The additive node 34 sums data from inverse VLC circuit 33 and data as selected by 
changeover circuit 45 for decoding the pictures 81 or 82. 

If the pictures B1 or 82 are decoded only from the previous picture 10, changeover switch 45 is 
changed over to its contact a. whereas, if the pictures 81 or B2 are decoded only from the posterior picture 
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P3. changeover switch 45 ,s changed over to its contact c. In either cases, data of the pictures .0 or P3 are 
supplied to additive node 34. rpversa nhase to changeover switch 36. 

me changeover ^»^ i f^rir^.rr«S^ - 3 9 is changed 

. That is. i, Changeover sw.icn 36 ,s Changed over to memory 37. 

s over 10 its con.ac. b or a. respective,,, ma result ,s that. .1. . arte p.clura 0 ^ 
Changeover switch "36 i! changed over to ,tt comae, b and P«>^ P = * s, ° ,„ e „ changed ov „ 

. changeover switch 39 is changed over to ,1s contact Z™"™*"??^ £„„„ „ ?„ Rg . 24 . so as to be 
,0 it, contact a. and the picture ,0 is read out trorn our,., = ory3 ^^"^ wt .,„ 
suoolied v,a changeover switches o-. -0 „ . oispw „ s . yi h 40 js a ,, eao ,, cha „c.d over to 

, p.ctures 31 and ™«*^y;°~::Z£«™«^ «• « — « • * 
its contact o. so .ha. ,c ur s B, no =2 » ^ ^ ^ ^, lch M is 

^^^"^tr^ P3 atread, stored inbufter memory 33 is read out and suppi.eo . 

T™ Circuit arrangement o, the inverse *SS^ >•<>.- shown 

The .nve.se VLC circuit 31 includes a barre ** 3 a .nd . co™ oec p e 9 ^ ^ ^ 

in Fig.28. input code data .re supp.ied by barre, ° ™ °* %o£S. or matching circuitry, no, 

,h. above-mentioned contro, I', as a shit, puan.it, ,o be shifted 

n.^rT ^1," ^ou^ « ne^od. data on ,h. ,e- a,d 3,b„ b.s.s to the code 

«n^n in m. co*t*l. as m. , o ^ presen , 

execute decoding .. based on the tie, suppiied irom 

„ R9 1n the case o, a prcture data decoding device TlZZfSTjSZ 

"""C f s 9 'S,h me ccure da.a oecoding device -or decoding on,» lieid-based picture data, decision 
« circuit 5, enters a .lag ,, step ST, . beiore "^g^^, , „am.-b,sed processing. » the 

'"a, step ST3. decision circuit S. causes me dispiav unit 52 to dispiay mat decoding ,s impossible, 
betore '^^^^ 51 caus „ „,„ a , sp ,ay unit 52 to d,sp,.y ma. decoding is possibie. be,ore 

decision Circuit 5, en.ers a (lag a. step ST, ^^J!" ™J f„. M . b ased processing. ,1 me result 

A, step ST2. decision circuit 51 ch.c k s,o , Uhe teg^ca s a ^ ^ 

- ^r^ss^r™^ sruni', « » «^ ,« - 

before terminating the program. 
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1. A picture data encoding method for encoding interlaced picture signals, comprising deciding if 
encoding of said picture signals is to be executed on the field-by-field basis or on the frame-by-frame 
basis, 

encoding said picture signals on the field-by-field basis or on the frame-by-frame basis, depending 
on the results of decision, fcr generating encoded picture data, and 

appending discrimination data to said encoded picture data based on the results of decision, said 
discrimination data indicating if said encoded picture data have been encoded on the field-by-field 
basis or on the frame-by-frame basis. 

2. The picture data encoding method as claimed in claim 1 wherein said encoded picture data has a 
hierarchical structure and wherein said discriminating data is provided at a portion of a header of a 
predetermined hierarchical layer of said encoded picture data. 

3. The picture data encoding method as claimed in claim 1 wherein the decision is executed based on a 
motion vector of each macro-block. 

4. A picture data encocing method for encoding interlaced picture signals, comprising 
encoding said picture signals on the field-by-field basis for generating encoded picture data, and 

appending to said encoded picture data discrimination data indicating that said encoded picture data 
have been encoded on the field-by-field basis. 

^ 5. The picture data encoding method as claimed in claim 4 wherein said encoded picture data has a . 
hierarchical structure and wherein said discriminating data is provided at a portion of a header of a 
predetermined hierarchical layer of said encoded picture data. 

6. A picture data encoding method for encoding interlaced picture signals, comprising 
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^encoding said picture signals on the frame-by-frame basis for generating encoded picture data, and 
appending discrimination data indicating that said encoded picture data have been encoded on the 
frame-by-frame basis to said encoded picture data. 

7. The picture data encoding method as claimed in claim 6 wherein said encoded picture data has a 
hierarchical structure and wherein said discriminating data is provided at a portion of a header of a 
predetermined hierarchical layer of said encoded picture data. 

8. A picture data decoding method for decoding picture data encoded with a block consisting in a two- 
dimensional array of plural pixels as an encoding unit, comprising 

detecting a flag provided at a portion of a header for said picture data for discriminating if the 
picture data have been encoded on the field-by-fieid basis or on the frame-by-frame basis, and 

adaptively switching between field-based decoding and frame-based decoding based on said flag. 

9. The picture data decoding method as claimed in claim 8 wherein said picture data has a hierarchical 
structure, said flag is detected from a header of a predetermined hierarchical layer of said picture data, 
and wherein switching is adaptively made between field-based decoding and frame-based decoding for 
each of predetermined hierarchical layers based on said flag. 
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10. A picture data decoding method for decoding picture data encoded with a block consisting in a r.vo- 
dimensional array of plural pixels as an encoding unit, comprising 

detecting a flag provided at a portion of a header for said picture data for discriminating if -.he 
picture data have been encoded on the field-by-field basis, and 

decoding the picture data based on said flag on the field-by-field basis. 

11. The picture data decoding method as claimed in claim 10 wherein 
said oicture data has a hierarchical structure, and wherein 
said flag is detected from a header cf a predetermined hierarchical layer. 

12. A picture data decoding method for decoding picture data encoded with a block consisting in a two- 
dimensional array of piural pixels as an encoding unit, comprising . . „. •■ . K 

detecting a flag provided at a portion of a header for said picture data for discnm.nating n the 
picture data have been encoded on the frame-by-frame basis, and 

decoding the picture data based on said flag on the frame-by-frame basis. 

13. The picture data decoding method as claimed in claim 12 wherein 
said picture data has a hierarchical structure, and wherein 
said flag is detected from a header of a predetermined hierarchical layer. 

14. A picture data encoding device for encoding picture signals with a block consisting in a two- 
dimensional array of plural pixels as an encoding unit, comprising 

decision means for deciding if encoding of said picture signals is to be executed on the helo-by- 
field basis or on the frame-by-frame basis, 
25 encoding means for encoding said picture signals on the field-by-field basis or on the frame-by- 

frame basis, depending on the results of decision by said decision means, for generating encoced 

P,CtU a poending means for appending discrimination data to said encoded picture data based on the 
results' of decision. sa,c discrimination data indicating if said encoded picture data have been encoded 
30 on the field-by-field basis or on the frame-by-frame basis. 

15. The picture data encoding device as claimed in claim 14 wherein said appending means include a 
variable length encoding circuit. 

16. The picture data encoding device as claimed in claim 14 further comprising motion vector detecting 
means for detecting a motion vector between first and second fields of a current frame to be encoded, 
said decision means executing said decision based on the motion vector from said motion vector 
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A picture data encoding device for executing block-based encoding of picture signals consisting in an 
interlaced two-dimensional array of pixels for generating encoded picture data having a h.erarcnical 

structure, comorising r~t~~ 

motion vector detecting means for detecting the motion vector between first and second f.e.cs of a 
current frame of said picture signals to be encoded, from one macro-block to another. 

decision means for deciding if encoding of said picture signals is to be executed on the „e.d-by- 
field basis or on the frame-by-frame basis, based on the magnitude of a median of the motion vector 
from said motion vector detecting means, as found over a predetermined hierarchical layer 

encoding means for encoding said picture signals on the field-by-field basis or on the frame-oy- 
frame basis, based on the results of decision from said decision means, and 

variable length coding means for variable length encoding said encoded picture da ta -nd dis- 
criminating data indicating that said encoded picture data have been encoded on the held-by-he.d basis 
or on the frame-by-frame basis, based on the results of decision from sa.d dec.s.on means =nd for 
causing said discriminating data to be appended to a header of a predetermined hierarchical layer of 
said picture data. 

18. A picture data decoding device for decoding picture data encoded with a block consisting of a two- 
dimensional array of plural pixels as an encoding unit, comprising 

detecting means for detecting a flag provided at a portion of a predetermined hierarchical layer of 
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said picture data for indicating whether the picture contained in said hierarchical layer has been 
encoded on the field-by-field basis or on the frame-by -frame basis, and 

decision means for deciding, based on said flag from said detecting means, whether or not 
decoding of said picture data is possible. 

The picture data decoding device as claimed in claim 18 further comprising display means for 
displaying the results of decision from said decision means. 

The picture data decoding device as claimed in claim 18 further comprising an inverse variable ienaih 
coding circuit, said r'tag being discriminated by said inverse variable length coding circuit. 

A picture data decoding device for decoding picture data encoded with a block consisting in a two- 
dimensional array of plural pixels as an encoding unit, comprising 

detecting means for detecting a flag provided at a portion of a predetermined hierarchical layer of 
said picture data for indicating whether the picture contained in said hierarchical layer has been 
encoded on the field-by-field basis or on the frame-by-frame basis, and 

decoding means for decoding said picture data by selecting, for each of hierarchical layers, one of 
field-based decoding or frame-based decoding depending on the flag from said detecting means. 

The picture data decoding device as claimed in claim 21 further comprising an inverse variable length 
coding circuit, said flag being discriminated by said inverse variable length coding circuit. 

A picture recording medium having recorded thereon encoded interlaced picture data encoded on the 
field-by-field basis or on the frame-by-frame basis in a hierarchical layer, and discriminating data 
provided at a header of a predetermined one of .said hierarchical layers for indicating whether the 
encoded picture data have been encoded on the field-by-field basis or on the frame-by-frame basis. 
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